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• . with a pertinent probability Pi , then the average path (defined as the weighted average positions 
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Introduction 



Physics is fundamentally divided in two parts. Mechanics, which includes classical, relativistic 
and quantum mechanics, is ruled by the least-action principle (LAP) and characterized by laws 
invariant by time reversal. Thermodynamics, on the other hand, can be entirely deduced from 
the maximum entropy principle (Maxent), and introduces a so-called "arrow of time". 
LAP is equivalent to Newton's law in classical mechanics, and can be used in relativistic and 
quantum mechanics with an adequate reformulation of the Lagrangian. It has been historically 
applied, from Fermat to Feynmann, with such success that today it is a fundamental principle of 
physics. It has never been, to our knowledge, deduced from more fundamental principles. On the 
other hand, Maxent is justified by the Bayesian rules of probability, and simply gives the most 
probable probability distribution according to given constraints. For instance, in equilibrium ther- 
modynamics, Maxent proves that the average values of the extensive quantities used to describe 
the system (energy, volume, ...) are appropriate constraints. For an excellent summary of Maxent 
properties and their applications to thermodynamics, see [T|. Since Maxent is a principle of logic, 
it can and has been successfully applied in many different fields, for example biology, economy or 
ecology [3]. 

A mechanical system is defined by the values of a set of generalized coordinates at a given time. 
We make the hypothesis that these values, including time, are only known in average. Applying 
Maxent, we will logically deduce that the system will follow the Lagrangian laws of motion. LAP 
is a consequence of Maxent with appropriate constraints. 

In the first section, a least information principle will be deduced from Maxent. This section does 
not involve physics and is entirely mathematical. 

In the second section, the least information principle will be applied to a mechanical system whose 
space and time coordinates are only known in average. The least information principle will appear 
to be, up to a multiplicative constant with dimension Action, the least action principle. 
A third section will be devoted to possible physical interpretations of this result, and to new 
questions that then naturally emerge. 

1 The least information principle 

In this section, a least information principle, similar in form to the least action principle, will be 
deduced from Maxent applied with linear constraints. Provided that time is one of the constrained 
quantities, the rate of information will satisfy the equations of Cauchy-Riemann. 

1 . 1 The proof 

Notations: we write a sequence (sj) instead of (si, Si, Sj max )- Two sequences using the same 

3 max 

name of indice should have the same length. We write ^2 instead of ^ . A sequence using two 

i i=l 

indices is written (s,- ,■) instead of (si i s-i , , s,- 1 , .... s,- , ,~-,Sj ,• ). 



Some numbers (Ajy) are given once and for all, and will always be considered fixed. Some other 
numbers (Ak) are given and we wish to find the probability law (p^ satisfying the constraints: 



YsPiA^i = A k for all k 

i 

Following Jaynes, we will choose the probability law which maximizes the information: 
I = - ^pilnpi 

i 

Using Lagrange multipliers (a^) (see demonstration in Annex 1), we obtain the standard results: 
dI = J2 a kdA k (2) 

k 

tarll forallhJ (3) 

Adding another constraint on a new variable t that the reader can conveniently consider as time, 
the system (TjQ) then becomes: 

Y;PiAk,i = A k for all k 

i 

Y,Piti = t (4) 

E» = i 

i 

Calling (3 the Lagrange multiplier of t, the results ([2D and © become: 

dl = ^ Oi k dA k + f3dt (5) 



|a = |2l for all k, I 

dA t 8A k J ' 

da fc 8(3 

dt ~ 8A k 



(6) 



The notation A^ is an abbreviation for the exact differential ^4^. Equation §5§ can be written 



dl = ^2 a k A k + f^j dt 



Let 



L((A k ),(A k ),t) = J2^ k A k + P (7) 

k 

We have: 

dl = Ldt (8) 

We now prove our main result, which is that L satisfies the equations of Cauchy-Riemann. 
Since = (but note that jjj^ may be ^ 0), we have from equation (J7j) 

dL -s-y dai ■ d(3 

dA k = ^dA k l + dA k (9) 



i 



The (a k ) are only functions of the (A k ) and t, — ^- = —9- = for all fc, /. So we have: 

dL . . 

a fc (10) 



and 

eft a^i. ~ dt ~ QA t 1 dt [ ' 

As a result of 

d~t^A k = ^dT k l + dA~ k 
and, using (jHJ), 1/ satisfies the equations of Cauchy-Riemann: 

l^L = f „ all k (12) 

d * aA fc dA k 

As a consequence, the information 

h 

j = y L(t)dt (13) 

to 

is stationary. For any variations SA k (t) such that 5A k (to) = 5A k (ti) = for all k, we have 51 = 0. 

By analogy with LAP, this consequence will be subsequently referred as the "least information 
principle" . 



1.2 Conservation laws 



From equations (TTTT) and (|T2|) . we obtain 

One can also prove that (see demonstration in Annex 2) 

^ - Q ± (15) 
dt ~ dt 

If L does not depend explicitly on a given A}~, then the conjugate quantity ctk does not vary with 
t. If L does not depend explicitly on time t, then the quantity (3 does not vary with t. 

1.3 Degree of validity 

Note that in this problem, there are no assumptions about the nature of the (A^). In particular: 

- The (Ak) do not have to be frequency averages of the values [A^^) in an experiment. Neither 
do we need a notion of ensemble. We want to find a probability law which reflects our state of 
knowledge, not a property of some system. 

- The (Ak) do not have to be extensive quantities (a thermodynamical concept not necessary for 
Maxent), neither do they need to scale together or have any other relationship. 



2 Physical application 

Let us consider a mechanical system whose state is defined by a set of independent coordinates 
(qk). The motion of such a system can be described by a parameterized curve (gfc(A), t(A)) in the 
{(qk),t) space. We call such a curve a path. The system can potentially take many different paths 
from a given starting position (gfc(O), t(0)). Let us denote by (i) the set of all these paths, and 
adopt, without loss of generality, a common parameter A to describe all these paths. A given path 
i is then described by the k + 1 functions ((<?fc,i(A)), ij(A)). 

We make the hypothesis that the observed path ((qfc(A)), t(X)) is the average path of all the paths 
i, each one occurring with a probability p^. Mathematically: 



X>A(A)=t(A) 



for all k 



for all A 



We now fix the parameter A, and no longer write the dependence on A. The Maxent distribution 
which satisfies the preceding constraints is the solution of system with: 



Qk,i — ^k,i 

Qk = A k 

Equations (171). (IBj). (111)1 and (115]) become: 



fc 

= Y J a k dq k + (3dt = Ldt 

k (16) 

^ = £ /or oil k 

d§_ _ dL 
dt dt 

We recognize the equations of Lagrangian mechanics: L being the Lagrangian, the (a k ) the 
generalized momentum, (3 the opposite of the Hamiltonian and / the action. But this can not 
be correct because, for instance, the dimension of action is Action = Energy x Time while I is 
dimensionless. However, K being an appropriate constant of dimension Action, we recover all 
lagrangian mechanics with the following identifications: 

K I = S action 

K a k = p k generalized momentum 

K f3 = —H — Hamiltonian 

K L = £ Lagrangian 

which give 



k 

dS = ^2 p k dq k — Hdt = Ldt 

k 

dp k _ PC 
dt dqk 
d(-H) = dC 

dt dt 



(17) 

for all k 



The least information principle becomes the least action principle. The action 
S = JC{t)dt 

to 

is stationary. For any variations SA k (t) such that 5A k (to) = 5A k (ti) = for all k, we have 5S = 0. 
Equations (|T4l) and (1T51) become: 

^ = ¥- for all k 

dt dq k J 
d(-H) _ dC 



dt dt 

which is Noether's theoremJO]. The generalized momentum is conserved if C does not explicitly 
depend on the associated generalized coordinate. Energy is conserved if C does not explicitly 
depend on time. 



3 Comments and open questions 



The relation between the Lagrangian C and the Hamiltonian H appears naturally in the Maxent 
formalism. However, while one is the Legendre transform of the other, they do not play roles 
similar to / and ln(Z). In fact, the partition function Z does not play any particular role in our 
description (the properties of ln(Z) mirror the properties of IpQ), and it is the presence of time 
which induces the relation between L and H. 

One can also note that, fundamentally, C satisfies the equations of Cauchy-Riemann because dl 
is an exact differential. 

In the case of conservative forces, LAP is equivalent to Newton's law. This means that Laplace's 
equations could have emerge without any hint of Newton, if the work of Jaynes (1922-1998) had 
been known. Maxent as a fundamental physical principle certainly has epistemological implica- 
tions. 

For a given physical path, there is a constant K with dimension Action such that 

K I = S action 

K ah = Pk generalized momentum 

K j3 = —H — Hamiltonian 

K L = C Lagrangian 

But nothing a priori prevents the value of K to be different in different experiments. One can 
not calculate the values of I and of the (atk) for a given physical path, since the (qk,i) are a priori 
unknown. 

However, the (qk,i) could eventually be known using quantum mechanics (QM). To simplify, let us 
state that there is one single coordinate x. In QM, a system is defined by its wave function, which 
is a function of space and time <p(x,t). We can identify a path i of probability pi with the set of 
all (x,t) such that \ip(x,t)\ 2 = pi (of course, to be rigorous, the discrete set of paths (i) first has 
to be replaced by a continuous set). Since the paths % are known, the (x,(A)) and (ti(A)) are also 
known if we can parameterize all these paths with a common parameter A. Deducing the value of 
K rests an open question. 

Feynmann's path integral formulation of QMj3] offers similarities with our description. However, 
the two theories also present fundamental differences. In particular, Feynmann assigns equal 
probabilities to all paths, and does not average the action S, but the quantity e lS l h . A possible 
link has to be investigated. 

An analogy between mechanics and thermodynamics has already been found[5j. It uses the for- 
malism of differential forms, but the main results can be obtain using Maxent. This analogy 
comes fundamentally from the fact that Lagrangian mechanics and equilibrium thermodynamics 
can both be described by a set of linear constraints as has been shown in this paper. 



4 Conclusion 



The least action principle (LAP) is a consequence of Maxent, provided that the constraints concern 
the average coordinates and the average time of a mechanical system. The simplicity of the 
demonstration and the high degree of generality of Maxent explain why LAP is so general in 
mechanics. 

This demonstration of LAP sheds a new light on the relationship between thermodynamics and 
mechanics. It offers an opportunity to unify these two branches of physics, with Maxent as a 
common basis. 



Annex 1 



This annex demonstrates classical results about Maxent distributions [7j. 

Some numbers (A k ^ are given once and for all, and will always be considered fixed. Some other 
numbers (A k ) are given and we wish to find the probability law (p^) satisfying the constraints: 

Y^PiA kti = A k for all k 

e* = i (18) 

i 

The Maxent principle consists of choosing the distribution (pA which maximizes the information 
I = -^ Pi ln(pi) 

i 

Using the method of Lagrange multipliers, let 
£ = I - ^2a k ^2piA kji -yy~]pj 

hi i 

where (ak) and 7 are new variables called the Lagrange multipliers. 
The distribution (p t ) should satisfy 



d£_ 

dpi 



= - \n(pi) - 1 - 2J a k A k)i - 7 for all i 



Calling 



Z t = e * 



i 

we obtain 



(19) 



z 

and 



- ln(pi) = ^ a kA k ,i + ln(Z) 

k 

Therefore 



Differentiating ( TL9l) . we obtain: 
-AkiZi = —Ak,iPiZ 



da k 



da k ^ da k 



that we write: 

ain(Z) 

da k 



-A k for all k (21) 



The (ak) can be found by solving the /c max equations of this last system. ln(Z) is an exact 
differential and can be written: 



d(\n(Z)) = -Y,A k da k 

k 

Using equation (1201) . we can now find an expression for dl. Since the quantities (Ak) are indepen- 
dent, = S k> i (Sk,i is Kronecker symbol) and: 

dl sr-^ da k d\nZ 

Since Z is a function of the a^, 

dlnZ <s—y d\nZ da k 
dAl = l-;da k ~dA l 

k 

and using (12"I1) : 

dliaZ ^-a 9afc 

We finally obtain: 

= a fc for all & 

/ is a function of the (A^): 



<i/ = a k dA k 

k 



dl is an exact differential: 

d 2 S d 2 S 



dA k 8A l dAidA k 

therefore: 

da k dan , „ 
OA, dA k 



Annex 2 



We have, by definition of L (equation ((7|)): 
(3 = L - 22 a kM 

k 

So 



when 



Since 




taking into account (TjTJJ and fflUj) . we obtain 

d£_dL 
~dt ~ ~dt 
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